STEWARD: demo of spatio-textual extraction on the web aiding the retrieval of documents
نویسندگان
چکیده
A spatio-textual sear h engine, termed \STEWARD" is demonstrated where do ument similarity is based on both the textual similarity as well as the spatial proximity of the lo ations in the do ument to the spatial sear h input. STEWARD's performan e is enhan ed by the presen e of a do ument tagger that is able to identify textual referen es to geographi al entities. The userinterfa e of STEWARD provides the ability to browse results, thereby making it a valuable \knowledge dis overy" tool.
منابع مشابه
The SPIRIT Spatial Search Engine: Architecture, Ontologies and Spatial Indexing
The SPIRIT search engine provides a test bed for the development of web search technology that is specialised for access to geographical information. Major components include the user interface, geographical ontology, maintenance and retrieval functions for a test collection of web documents, textual and spatial indexes, relevance ranking and metadata extraction. Here we summarise the functiona...
متن کاملDemo Paper: A Spatio-Temporal-Textual Crime Search Engine
This paper proposes a STT(spatio-temporal-textual) search engine for extracting, indexing, querying and visualizing crime information. Until recently, it’s a labor-intensive work to identify crime entities, cluster similar suspect activities, and discover patterns from massive online collections. It’s a big challenge to reveal inherent ST(spatio-temporal) correlations among mass crime informati...
متن کاملExtending jCOLIBRI for Textual CBR
This paper summarises our work in textual Case-Based Reasoning within jCOLIBRI. We use Information Extraction techniques to annotate web pages to facilitate semantic retrieval over the web. Similarity matching techniques from CBR are applied to retrieve from these annotated pages. We demonstrate the applicability of these extensions by annotating and retrieving documents on the web.
متن کاملبازیابی اطلاعات تصویری حوزهی سلامت در وب از دیدگاه متخصصان علوم پزشکی:یک مطالعه کیفی
Introduction: The medical image as a source of non-textual information has an important role in the field of medicine. Since the quality of life is directly related to health, employing this type of information is effective in improving the practice of health professionals. This study was aimed to survey medical image retrieval in the Web from the perspective of experts in medical sciences. M...
متن کاملUsing Fuzzy LR Numbers in Bayesian Text Classifier for Classifying Persian Text Documents
Text Classification is an important research field in information retrieval and text mining. The main task in text classification is to assign text documents in predefined categories based on documents’ contents and labeled-training samples. Since word detection is a difficult and time consuming task in Persian language, Bayesian text classifier is an appropriate approach to deal with different...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007